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SPECTRAL GAP FOR THE ZERO RANGE PROCESS 
WITH CONSTANT RATE 

By Ben Morris 

University of California, Davis 

We solve an open problem concerning the relaxation time (inverse 
spectral gap) of the zero range process in Z d /LZ d with constant rate, 
proving a tight upper bound of 0((p+ 1) 2 L 2 ), where p is the density 
of particles. 

1. Introduction. 

1.1. Basic definitions. Letting G = (V, E) be a finite, connected, regular 
graph, we define a configuration as follows. In a configuration, a total of 
r indistinguishable particles are distributed over the vertices in V . For a 
configuration rj, we denote by rj(v) the number of particles in vertex v [so 
that J2v£V n ( v ) = r ]- We define the density of particles as p = r/\V\ . 

The zero range process with rate A is the following continuous-time Markov 
process on configurations. Suppose that the current state is rj. For each 
vertex v at rate A, if rj(v) > 0, we expel a particle from v to a random 
neighbor, that is, decrease rj(v) by one, choose a neighbor w of v uniformly 
at random (u.a.r.) and increase rj(w) by one. 

Note that since the process is irreducible and has symmetric transition 
rates, the distribution at time t converges to uniform as t — > oo. Let C denote 
the space of configurations, and for probability distributions fj,, v on C, let 

||/i-i/||=max/z(Q)-i/(Q)= mm P(X^F) 

be the total variation distance. The spectral gap, defined as the absolute value 
of the second largest eigenvalue of the generator of the process, governs the 
asymptotic rate of convergence to the stationary distribution (see, e.g., [6]). 
More specifically, we have 

(1) gap = min lim log II K l (x, •) — U\\, 

x t— too t 
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where K is the transition kernel for the zero range process and U is the 
uniform distribution over configurations. We define the relaxation time to 
be 

Trclax = 1/gap = max lim -t/log \\K t (x, ■) - U\\. 

x t— >oo 

Thus, the relaxation time is the smallest value of r such that for every 
starting configuration x, there is a constant C such that \\K t (x,-) —U\\ < 
Ce~ t / T for all t > 0. 

This paper is concerned with bounding the relaxation time in the impor- 
tant special case where G is the d-dimensional torus Z d /LZ d . Note that the 
spectral gap is proportional to the rate A, so we can assume without loss of 
generality that A = 1. We will call this process the ZRP on G. We will take 
the dimension d to be arbitrary but fixed and bound the relaxation time as 
a function of p and L. 

1.2. Background, motivation and summary of results. The zero range 
process is a widely studied Markov chain in statistical mechanics. In the 
general zero range model, there is a rate function c(k) that gives the rate 
at which a site occupied by k particles expels one [e.g., the assumption of 
independent random walks corresponds to c{k) = k]. In our model, we have 
c(k) = 1 for all k, so the rate at which a vertex expels a particle does not 
depend on the number of particles there. In the language of queuing theory, 
this is a closed queuing network with i.i.d. exponential service times. 

A key estimate needed for hydrodynamic limits is a sharp upper bound on 
the relaxation time (see [11]). Landim, Sethuraman and Varadhan [5] used 
the techniques of Lu and Yau [7] to prove that under certain conditions on 
the rate function c(-), the relaxation time is 0(L 2 ); see also [1]. However, 
their conditions did not apply to a bounded rate function since they required 
that there be a constant 7 > such that c(k) > ^k for sufficiently large k. 
Indeed, as they observe (see Example 1.1 of their paper), their result does 
not apply to the case of a constant rate since their bound holds uniformly 
in p, but the relaxation time is of order (1 + p) 2 L 2 when the rate is constant 
and d = 1. In fact, a lower bound of order (1 + p) 2 L 2 holds for all d; this can 
be seen by substituting the test function f{rf) = J2vev v( v ) cos ( 2! i 1 ) into the 
variational definition of spectral gap (see, e.g., [4]). This type of function 
was used by David Wilson in [12]. 

While we cannot expect an 0(L 2 ) bound to hold uniformly in p, it is 
natural to ask whether such a bound holds if we incorporate the extra factor 
(p+ l) 2 . In this paper, we prove that this is indeed the case. 

We will study the ZRP on the torus Z d /LZ rf , but our proof is easily 
extended to variants such as the box {0, 1, . . . , L} d . Indeed, our proof shows 
that for an arbitrary regular graph G, the bound on the spectral gap for 
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random walk on G given by the paths technique (see [3, 9]) also applies, up 
to constant factors, to the ZRP on G. Similar results were already known 
for the exclusion process; see [2]. 

2. Comparison technique. It turns out that the ZRP is easier to analyze 
on the complete graph than on the torus. Fortunately, comparison techniques 
(see, e.g., [2]) allow us to reduce to this case. Let G\ = Z d /LZ d and let G2 be 
the complete graph on L d vertices (not including self- loops). Let Vi denote 
the set of configurations of the ZRP on G\, let £\ = {(r},rf) :rf,rf 6 Vi and 
77 — ► 7/ is a transition in the ZRP on G7i} and let Q\ be the graph (Vi,£i). 
Note that the ZRP on G\ can be described as the continuous-time random 
walk on Q\ in which edges are crossed at rate l/2d. Similarly, if we define 
V2,<?2 and Q2 analogously (with G2 replacing G±), then the ZRP on G2 is 
the random walk on Q2 in which edges are crossed at rate l/(L d — 1). Fairly 
standard techniques allow us to estimate the relaxation time for the ZRP 
on G\ by comparing it with the ZRP on G2. More specifically, we will set 
up a suitable multicommodity flow on Q\ in which we route one unit of flow 
from t] to 7/ for each (77,7/) G £2 simultaneously. For any such flow / and 
any edge e in £\, let /(e) denote the total flow along e; that is, /(e) is the 
sum over all ordered pairs (r],rj') £ £2 of the 77 — ^ 77' flow carried by e. Define 
the congestion C(f) = L }_i max e /(e), that is, the maximum flow along any 

edge normalized by L d — 1, and the length £(/) to be the length of a longest 
flow-carrying path. The following theorem is a special case of results in [2] 
(see also [10]): 

Theorem 1 ([2]). Let t\ and T2 denote the relaxation times for the 
ZRP' s on Gi and G2, respectively. For any flow f , we have T\ < 2dC(f)C(f)T2 

We will bound t\ by constructing a flow / with congestion C(f) < L and 
length C(f) < dL. By Theorem 1, this implies t\ < 2d 2 L 2 T2- We will now 
describe the flow /. Let (77,7/) be an edge in £2- Then the configurations 77 
and rj differ at only two vertices u and v , with, say, rj(u) = rj'(u) + 1 and 
r]{v) = rf{v) — 1. For a vertex w, define Xw to be the configuration that has 
just a single particle, located at w. Define 77 At/ as the vertex-wise minimum 
of rj and 7/ , so that 77 A 7/ = r\ — \ u = r/ — Xv We will use i] — > 7/ flows that 
pass only through configurations of the form 7] A 7/' + Xw', that is, the flow- 
carrying paths send the "extra particle" at u to v along a path in G\. To 
specify such a flow, it is enough to specify a flow g uv from u to v in G\. We 
will use the one that simply spreads flow uniformly over all shortest paths 
from u to v. Note that if (C1X2) is an edge in a shortest path from rj to 7/ , 
then Ci A C2 = ?? A r{ . 
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Fix an edge e = (Ci? C2) G £i- We must bound the total flow through e. Let 
( = (j A £2. Then all pairs (77, 77') such that some 77 — ► 77' flow passes through 
e must satisfy 77 A 77' = £. For vertices ttj, define = C + Xw Then 

/( e ) =2/c",c( e ) 

But Lemma 2 below says that J2 U v 9uv(e) < L(L d - 1). It follows that C{f) < 
L. Finally, note that the flow / also satisfies £(/) < dL. Hence, Theorem 1 
implies that t\ < 2d 2 L 2 T2- 

The following lemma was used in the estimate for C(f): 

Lemma 2. For vertices u,v E Z d /LZi d , let g uv be the flow that spreads 
flow evenly among all shortest paths from u to v. Then, for any edge e, 

Y,9uv(e)<L(L d -l). 

u,v 

Proof. Define g = J2 uv g U v By symmetry, the quantity g(e) does not 
depend on e. But the maximum length of any shortest path is less than dL, 
the number of ordered pairs of distinct vertices is L d (L d — 1) and the total 

number of edges is dL d . Hence e carries at most = L(L d — 1) 

units of flow. □ 

3. The ZRP on the complete graph. In the previous section we showed 
that n < 2d 2 L 2 r 2 . Thus, to prove an 0{(p + 1) 2 L 2 ) bound for n, we need 
only show that the relaxation time T2 for the ZRP on the complete graph 
can be bounded by 0((p + l) 2 ), uniformly in the number of vertices. 

Theorem 3. Let K n denote the complete graph on n vertices. Fix p > 
and let r be the relaxation time for the ZRP on K n with a density p of 
particles. Then r < C(p+ l) 2 for a universal constant C. 

Proof. We will estimate r using coupling; that is, we construct a pro- 
cess ((rjt,r]' t ),t > 0) such that (r] t ,t > 0) and (r}' t ,t > 0) are each ZRP's and 
the distribution of r/' is uniform. Let T = inf{i : r]t = n' t } be the coupling 
time. It is a standard fact that 

(2) 11^(77, ■)-w||<p f7 (r>t), 

where we write P^( • ) := P( • 1 770 = 77). The following lemma relates the re- 
laxation time r to the tail of the distribution of T: 
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Lemma 4. Fix 7 > and suppose that E( e T T ) < 00. Then r < 1/7. 



PROOF. If E(e^ T ) < 00, then lim*^ e?*P(T > t) = 0. Hence P(T > t) < 
e~ 7 * for sufficiently large t. Combining this with equation (2) gives r < I/7. 



Zero range process with ranked particles. We will actually study a process 
with labeled particles. Suppose that the particles are labeled 1, ...,r and 
each particle x has a rank Rt(x) £ {1, . . . , r} at time t. Every particle has a 
unique rank, so Rt(-) is a permutation. We assume that when vertex u rings, 
the highest ranking particle at u (i.e., the particle whose rank is the smallest 
number) is expelled. Let vt(i) denote the location of the particle of rank i at 
time t. Then rjt(u) = Ya=i l( v t(i) = u), where we write 1(A) for the indicator 
of the event A. For positive integers j, we define r]t(u,j) = J2l=i l( v t(i) = u). 
We will use the notation %(•,•) to denote a ZRP with ranked particles. 

We allow the rankings to vary in time. However, we assume that the 
vertices ring independently of the rankings. Thus, if the particles of rank 
1, . . . , j maintain a constant rank [or even if the set {x : 1 < R(x) < j} does 
not change], then the process (rjt(-,j) - t>0) behaves like the ZRP with j 
particles. 

The coupling. We now describe the coupling. The coupling will consist 
of r stages, going from Stage to Stage r — 1. At the end of Stage j, 
configurations rjt and rj t will satisfy 77^ (- , j + 1) = rf t (-, j + 1). Thus, we will 
have rjt = n' t at the end of Stage r — 1. We will now describe Stage r — 1. 
(Earlier stages will be similar.) The stage consists of two steps: 

• Step 1. First, we rank the particles so that the location of each particle 
whose rank is less than r is the same in n as it is in ?/. Throughout Step 
1, we couple the processes so that corresponding vertices ring at the same 
time, and when particles are expelled, they choose the same destination. 
This ensures that the locations of the particles of rank 1, 2, . . . , r — 1 remain 
matched. We run the first step until 



that is, the number of particles in the same location as the particle of rank 
r is the same in n as it is in 77'. Let a = vt(r) and b = v' t (r). Note that 



□ 



Vt(v t (r),r - I) = n' t (v' t (r),r 



1) 



(3) 



Vt(a) = r]' t (b) 



and 



(4) 



Vt{b)=v't(a) 



at the completion of Step 1. 
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• Step 2. In Step 2, we couple so that vertex a in r/ plays the role of vertex 
b in rj' , and vice versa. This is accomplished as follows. First, the particles 
located at vertices a and b are given new rankings so that the particles of 
vertex a (resp. b) in i] correspond to the particles of vertex b (resp. a) in 
if. Second, we couple so that vertex a (resp. b) rings in the process rjt if 
and only if b (resp. a) rings in the process rf t . If an expelled particle moves 
to a (resp. b) in rjt, then the corresponding particle moves to b (resp. a) 
in r)' t , and so on. This ensures that (3) and (4) persist throughout the 
duration of Step 2. We run Step 2 until rj t (a) = rj t {b). Note that this 
implies rj t (-,r) = r)' t (-,r), and we are done. 

Stage j for 1 < j < r — 1 can be described similarly, except that in Stage j, 
only the trajectories of particles 1, . . . , j + 1 are coupled. Particles j + 2, . . . , r 
are allowed to move in an arbitrary way. Note that since particles j + 2, . . . , r 
move only when there are no higher ranking particles in the same location, 
they do not interfere with the trajectories of particles 1, . . . , j + 1, so Stage j 
can proceed in similar fashion to Stage r — 1. Stage j finishes when rj(-,j + 
l)=r/(-,j + l). 

Let Ct be an encoding of the coupled pair of labeled particle processes and 
the rankings at time t, and let stage f and step i denote the stage and step, 
respectively, in progress at time t. Then the process ((Ct, stage t , step t ) : t > 0) 
is a Markov chain. We wish to show that for a universal constant C > 0, 
the time r to complete stages 1 through r — 1 satisfies E(e 7T ) < oo for 
7> 1/C(p + 1) 2 . The following lemma says that it is enough to show that for 
any particular step of any stage, the time T spent there satisfies E(e 7T ) < oo 
for 7 > l/CV+l) 2 : 

Lemma 5. Let Xt be a finite- state Markov chain with k transient classes 
Si,S2, ■ • ■ , Sk, and suppose that there are no transitions from Sj to Sj-i for 
j > 2. Define Tj = inf{t > : Xt ^ Si U • • • U Sj}. Fix 7 > and suppose that 
for all j and x G Sj, we have E x (e 7Tj ) < 00. Then E(e 7Tfc ) < 00, irrespective 
of the distribution of Xq. 

Proof. Define T = 0, and for j G {1, . . . ,k}, let Wj = Tj - Tj_ x . Then 
T k = T,j =1 Wj. So E(e 7T ^) = E(e 7 W+'"+^)). Let aj = max x€Sj E x (e^). 

We show by mathematical induction that E(e 7 ^ 1_l ^a)) < nj = i Q j f° r an 
/ < k. The base case / = 1 follows from the definition of u\. Now fix I < k 
and suppose that E(e^ Wl+ - +Wl) ) < U l j=i a j- Let Y be the first state that 
Xt visits after leaving Si. Then, 

E ( e 7(Wi+.«+Wi+i)) = E ( E ( e 7(Wi+---+w i+1 )| Wi; _ _ _ ^ Wi,Y)) 

= E(e 7{w/l+ - +Wi) E(e 7Wi + 1 1 Wi, . . . , W h Y)) 
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= a i+ iE(e^ 1+ - w ')), 
and the result follows. □ 

To complete the proof of Theorem 3, we need to show that the time T 
spent performing a particular step satisfies E(e 7T ) < oo for 7 > 1/C(p + l) 2 . 
Lemmas 9 and 10 below do this for steps of type 2 and 1, respectively. □ 

We will need the following lemma: 

Lemma 6. Fix v G V and suppose that rjo(v) < 2(p + 1). Let Z = \{t < 
(p + l) 2 :r) v (t) = 0}|. Then, for universal constants C > and M > 0, we 
have E(ZAAf(p + l)) >C(p+l). 

Proof. Note that the ZRP on K n can be described in the following 
way. For each ordered pair of distinct vertices (u,v) at rate l/(n — 1), the 
process attempts to move a particle from u to v ; that is, the move is made 
unless u is empty. Thus, for any vertex v, the total rate at which attempts to 
increase rjt(v) are made is 1, and this is also the rate of attempts to decrease 
rjt(v). Furthermore, when rjt(v) > it is always possible to decrease rft(v), 
so rjtiy) — ► r]t(v) — 1 at rate 1 in this case. Let i = rjo(v) and let X t be the 
continuous-time simple symmetric random walk on the integers, started at 
i, that moves in each direction at rate 1. 

One can easily couple the processes rjt(v) and X t so that rjt(v) = when- 
ever Xt = 0. Thus, it is enough to show that for constants C > and M > 0, 
we have E(Y AM(p + l)) > C(p+ 1), where Y = \{t < (p + l) 2 : X t = 0}|. 

We will first show that E(Y) > D{p + 1) for a universal constant D by 
showing that if t satisfies \{p+ l) 2 < t < (p+ l) 2 , then P(X t = 0) > 2D(p + 

I)" 1 - 

Note that Xt — i is distributed as the difference of two independent 
Poisson(i) random variables. Hence, Xt — i is approximately normal with 
mean and variance 2t when t is large. More precisely, the local limit theo- 
rem implies that there is a universal constant D such that for all t > | and 
j with |j| < \/8t, we have 

2D 

(5) P(X t -i=j)>— . 

Recall that i < 2{p + 1) and note that if t G [\{p + l) 2 , (p + l) 2 ], then y/t < 
(p + 1) < v2t- Hence i < \/8t. Thus, we can substitute j = —i into (5) and 
obtain 



P(X t = 0) > 2^/^ > 2 J D/(p + 1). 
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Next, we will show that Y is unlikely to be a very large multiple of (p + 1). 
Let Ti = inf {t : X t = 0} and for k > 2, let T k = inf{t > T k _ x + 1 : X t = 0}. For 
k > 1, let A fe be the event that X* / for all f £ [T fe + 1, T fc + (p + l) 2 ] . 

Proposition A. 2 in the Appendix gives 

P (X t ^ for all t £ [1, (p + l) 2 ]) > 6(p + l)" 1 , 

for a universal constant b. Combining this with the strong Markov property 
gives 

P{A k \Al...Al^)>b{p+l)-\ 

and hence P{Al---A%) < (I - b(p + l)~ x ) k . 

Recall that Y is the amount of time Xt spends at up to time (p + 
l) 2 . Note that Y < N, where N = m.i{k:T k > (p + l) 2 }. Hence, for every 
nonnegative integer k, we have 

P(Y> k) <P(iV >k) 

<V(A\...A%) 

<(l-6(p + l)- 1 ) fc . 

Let M = b~ 1 \og{2/Db). Note that 

Y = (Y AM(p+l)) + W, 

where W = (Y — M(p + 1))1(Y > M(p + 1)). Since E(Y) > D(p + 1), it 
follows that 

(6) E(y AM(p + l))>D(p+l)-E(W). 

But 

E(W) < Y, F (W>k) 

k>0 
k>0 

<^(l-6(p + l)- 1 ) M ^ +fe . 

fc>0 

By summing the geometric series and then using the fact that 1 + u < e u for 
all u, it is easily shown that E(W) < e~ bM b~ 1 (p+ 1) = + 1). Combining 
this with (6) gives E(Y AM(p + l)) > f (p+1). □ 

The following lemma is a consequence of Lemma 6. It says that when t is 
large, the average fraction of time the vertices spend empty is Q(l/(p + 1)), 
with high probability. For v £ V, define 

Z t (v) = \{s<t: Vs (v)=0}\ 

and let Z t = \ J2vev z t{v). 
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Lemma 7. There exists a universal constant a > such that for every 
t> (p + l) 2 , we have 

where C is the constant appearing in Lemma 6. 

Proof. Let M be the constant appearing in Lemma 6. For k E {1, 2, . . .}, 

let 

X k (v) = \{(k- l)(p + l) 2 < s < k{p + l) 2 : n,(v) = 0}| A M{p + 1) 

and let X k = = J2vev Xk {v) • Markov's inequality implies that at least half of 
the vertices v € V satisfy r]o(v) < 2(p+ 1). For such vertices v, Lemma 6 says 
that Epfi(u)) > C(p+ 1). Hence E(Xi) > \C(p+l). A similar argument 
applies to X k , for every positive integer k. It follows that if we define S k = 
J2i=i(Xi — \C{p + 1)), then {S^ :k>0) is a submartingale. Recall Azuma's 
inequality (see Corollary 6.9 on page 166 and Section (c) on page 168 of [8]): 
if M k is a submartingale with Mq = and \Mk + \ — Mk\ < B for all k, then 
for every A > and k > 1, we have 

p(mmM J <-A^<e~ bA2 / kB \ 

where b > is a universal constant. (In [8], Azuma's inequality is stated for 
martingales, but since any submartingale can be written as a martingale 
plus a nonnegative, predictable sequence, clearly the result must hold for 
submartingales as well.) Note that {Sk} satisfies \Sk+i — Sk\ < (M + ^C)(p + 
1). Hence, Azuma's inequality implies that 

V(S k <-A)<e- b ' A2 ' k ^\ 

for a universal constant b' > 0. 
Hence, 

P ^ kC ^ P + 1} ) = P(5fc " - l ± kC{p + l)) ~ e~ b ' C2k/16 - 

Note that Z fc(p+1)2 > X^ =1 Xj. Thus, for t = k(p+ l) 2 and a = b'C 2 /16, we 
have 

e> ^<-^)<-^<-^)<-^'- 

Thus the lemma holds for t of the form k(p + l) 2 . But since Zt is nonde- 
creasing in t, we can make (7) true for all t > (p + l) 2 by incorporating an 
extra factor of \ into a. □ 

We also use the following proposition: 
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Proposition 8. There is a universal constant A > such that if X 
and Y are independent PoissonfAJ random variables for sufficiently large X, 
then: 

(a) P(|X — A| > A/2) < e~ AX ; 

(b) P(X -Y>aX)< e~ Aa2x for every a £ [0, 1] . 

Proof. We prove (b) first. The moment generating function of X is 
0x(#)^E(e ex )=exp(A(e e -l)). 
It follows that the moment generating function of X — Y is 

0*-y(0) = M0)<M-0) 

= exp(A(e e -l) + A(e- e -l)) 
= exp(A(# 2 +o(# 2 )), 
since e u = 1 + u + \u 2 + o(it 2 ). Hence, 

P{X -Y>aX) = P(e^ x " y ) > e 6aX ) 
<e" toA fe-y(«) 

= e -eaA e A(0 2 +o(0 2 )) ) 

by Markov's inequality. Taking logarithms, we get 

log(P(AT — Y> aX)) < X(9 2 -9a + o{9 2 )). 
When 9 = a/2, this becomes 

A(^-y+o(a 2 ))=A(-ia 2 + o(a 2 )). 

Hence P(X — Y > aX) < eT Ac ^ x for a universal constant A and (b) is proved. 
For (a), note that 

(f>x-x(0) = exp(A(e 9 - 9 - 1)); <t>x-x(0) = exp(A(e^ + 9-1)). 
Using the inequality e'"' < e u + e~ u and setting 9 = \ gives 

<£|X-A|(§) < exp(A(e 1 / 2 - I - 1)) + exp^e" 1 / 2 + 1-1)) 

<2e A / 10 , 

since e u — u — 1 < 1/10 whenever Applying Markov's inequality 

to the random variable e^ x ~ x ^ 2 gives 

P(\X - A| > A/2) < e" A / 4 • 2e A / 10 = 2e" 3A / 20 , 

which is at most e _A//1 ° when A is sufficiently large. □ 

We will now use Lemma 7 to prove Lemmas 9 and 10. It turns out that 
steps of type 2 are easier to analyze, so we start with the following: 
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Lemma 9. Fix j 6 {0, . . . , r — 1} and let T be the time spent performing 
Step 2 of Stage j . Then E(e 7T ) < oo for 7 > D(p + 1)~ 2 , where D is a 
universal constant. 

Proof. We may assume that r = j + 1, since if there are particles that 
rank lower than j + 1, they do not affect the behavior of the particles ranking 
1, . . . , j + 1 and they increase the value of p. 

Let vertices a and b be as defined in the description of the coupling and 
suppose that r] t (a) > rjt(b) at the beginning of Step 2. Step 2 finishes when 
r]t{a) = rjt(b), so Step 2 has been completed by the time that r] t (a) = 0. Thus, 
it is enough to show that for any vertex v, if we define r = ini{t : r]t(v) = 0}, 
then E(e 7r ) < 00 for 7 > D(p + 1)~ 2 . 

Fix a vertex v, and for t > 0, let Xt be the number of attempts to increase 
the occupancy of v minus the number of attempts to decrease the occupancy 
of v, up to time t. Let Bt be the number of attempts to increase the oc- 
cupancy of v, up to time t, that fail because the potential source vertex is 
empty. Note that before time r, all attempts to decrease the occupancy of 
v will succeed. Therefore, 

Vt(. v ) =Vo(v)+X t - B u 

up to time r. It follows that P(r > t) < P(X t — B t > -770 («)). 

We assume that even when a vertex is empty, it attempts to move a 
particle to v at rate > ^. [We will use the convention that when v is 
empty, it attempts to move a particle to itself, and thereby increases B t , 
at rate l/(n — 1).] It follows that, given Zt, the conditional distribution of 
B t stochastically dominates a Poisson(Z i ) random variable. (Recall that the 
sum of the amounts of time the vertices have been empty up to time t is 
Zt = nZt-) Thus, for t sufficiently large, we have 

P (V « )< P ( B ,< « z t > « ) 

^ 8(p+l)/ _ V 8(p+l) - 4(^+1)^ 

(9) < e -AtC/4(p+l) + e -at/(p+l) 2 

where the second inequality uses part (a) of Proposition 8 (for the first term) 
and Lemma 7 (for the second term). It follows that for sufficiently large t, 
we have 

/ Ct \ 

(10) P[B t <— -—) <exp{-ct/(p + l) 2 }, 

for a universal constant c. Since Xt is the difference of two independent 
Poisson(t) random variables, we can apply part (b) of Proposition 8 to Xt 
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and get 

(11) p(x t > 16( ^ 1} ) <exp{-ylC 2 t/256(p + l) 2 }. 

Equations (10) and (11) together imply that for some constant 5 > 0, we 
have 

P(* ~Bt> -^^y) <exp{-5t/(P + I) 2 }, 

for sufficiently large t, and so 

P(T>t)<P(X t -B t >-r ]0 (v)) 
<exp{-<ft/(p + l) 2 }, 
for sufficiently large t [e.g., for i such that Ct/16(p + 1) > r]. It follows that 

E ( e «5T/2(p+l)2) <QO n 

Next, we consider steps of type 1. 

Lemma 10. Fix j G {0, . . . , r — 1} and let T be the time spent performing 
Step 1 of Stage j of the coupling. Then E(e 7T ) < oo for 7 > D(p + 1)~ 2 , 
where D is a universal constant. 

Proof. Again we may assume that r = j + 1. In order to analyze Step 
1, we will embed a coupling into a single process as follows. Consider a zero 
range process Tjt(-,-) with j + 2 ranked particles, and for t > 0, define £t and 
£by 

6 = + x«t(j+i)' $ = % (•'•?') + x«t0'+2)- 

So ^ and ^ are obtained from rj t {-, •) by deleting the particles of rank j + 2 
and j + 1, respectively. Let 770 (-, •) be defined so that (£oj £0) reflects the state 
of the coupling at the beginning of Step 1 of Stage j. Then the time T to 
complete Step 1 has the same distribution as W, where W = inf {t : r\ t (vt (j + 
+ 2) = 7] t (vt(j + 2),j + 2)}, because up to time W, the process (£t,£^) 
behaves exactly like the coupling during Step 1. [Note that the particle of 
rank j + 1 cannot interfere with the particle of rank j + 2 because W < 
mf{t:vt(j + l)=v t (j + 2)}.} 

We wish to show that E(e 7H/ ) is finite for 7 = D(p+ 1)~ 2 . Consider the 
process 't > 0), where for i > 0, we define Q = (vt(')i v t(j + 1)> ^t(j + 2)). 
[Here, we write rjt( m ) f° r the function that outputs the number of particles at 
each site, but contains no information about their rankings. So Q encodes 
only the number of particles at each vertex at time t and the positions of 
the two lowest ranking particles.] The particles of rank l,2,...,j will be 
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called high ranking; note that in the process the high ranking particles 
are indistinguishable. Furthermore, {Q ■ t > 0} is a Markov chain. 

It will be convenient to study a modified version of the process obtained 
by combining all of the states of the form (77, u, u) into a single state, which 
we denote by b. (Note that since the chain does not reach b before time 
W, making such a modification does not affect the distribution of W .) We 
shall also choose transition rates q b ^ x out of b that ensure a "nice" stationary 
distribution. (Again, this does not affect W .) For all x = (77, s, t) with s ^ t, 
let 



1 



(12) 



n — 1 
2 



1' 



if r](s) = r)(t) = 1, 
otherwise. 



Recall that if ir is a positive function on the state space that solves the 
balance equations 

y y 

for every x, then tt can be normalized to be the stationary distribution. [In 
fact, it is enough to verify (13) for all but one of the states x.] We claim 
that if 7r is defined by 7r(6) = 1 and 

(14) 7r(r],s,t) = r](s)r]{t) 

when s ^ t, then it solves (13). For all x, define Q\ n {x) = J2y 7T (y)ly,x and 
Qout(x) = ^2yK(x)q x ^y, so that (13) is equivalent to Q\ n {x) = Q OVl t{x) for all 
x. We will verify this for all x^b. For configurations x and vertices u and v, 
define Qout(x;{u,v}) = ~^2y^{x)q x ^ y , where the sum is over configurations y 
obtained from x by either moving a particle from u to v or vice versa, with 
a similar definition for Q[ n (x;{u,v}). Note that 

(15) Qout(x) = Qout(x;{u,v}), 

{u,v} 

(16) Qin(x) = Qin{x\{u,v}) + q b , x - 

{u,v} 

Fix x = (n, s, t) with s 7^ t. In order to calculate the values of Q Q ut(x, {u, v}), 
we shall consider three cases separately: 

Case {u,v} n {s,t} = 0. In this case, it is easy to verify that 
Qout(x, {u, v}) = Q in (x, {u, v}) 



(5(u)+5(v))- 



1, if r,(w) > 1, 
0, otherwise. 



14 B. MORRIS 

where for vertices w, we define 

S(w) ■ 

Case u = s,v ^ t. In this case, we have 

Q out (x,{u,v}) = (l + S(v)) 1 ^P-. 

n — 1 

If r](v) = 0, then 

w ,, M) .!!(!hM + iW 



n — 1 7i — l 

= V(s)y(t) 
n — 1 

If > 1, then 

(t7( S ) - l)7?(t) , (77(s) + l)77(t) 



n — 1 n — 1 

2r/(s)77(t) 



n — 1 

In both cases, Q ut(^ 5 {«) v}) = Qi n (x,{u,v}). 

Case u = s,v = t. This is the only case where Q ou t(x, {u, v}) ^ Qm(x, {u, v}). 
We have 

if rj(s) = T](t) = 1, 



Q out (x,{u,v}) = < 2 V (s) V (t) 



n — 1 



otherwise. 



If r?(s) > 2 and r,(t) > 2, then 

q^,, {„,„},= W.) - DM0 + 1) + W.) + i)W)-i) 

n — 1 n — 1 

= 27/(g)//(t) - 2 
n — 1 

If r?(s) = 1 and rj(t) > 2, then 

2fo(*) ~ 1) 
n — 1 

and if r/(s) = 77(f) = 1, then Qi n (^, {u, w}) = 0. It follows that 

1 



Qont(x,{u,v}) - Q in (x,{u,v}) 



Q in (x,{u,v}) 

ib — ± 
v})=0. It foil 

if 7,(8) = T](t) = l, 



n — 1 ' 

, otherwise. 

n—1 
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Putting all of this together with equations (12), (15) and (16) verifies the 
balance equations. 

Since the stationary distribution puts positive mass on every state, it is 
enough to show that E(e 7H/ ) is finite if the chain starts in its stationary 
distribution. 

Let qij be the rate at which Ct goes from i to j. The time reversal Ct 
of Ct is the process that starts in distribution ir and has transition rates q 
given by ir(i)qij = n(j)qj t i. For events A, we will write P(^4) to denote the 
probability of A when we run Q instead of Q, with a similar notation for 
expectation. Let B be the set of states (t],u,w) of Q satisfying 7](u) = r](w), 
including the amalgamated state b. Then 

P(W > t') = P(Ct i B for all t G [0,t']) 

= P(Ct i B for all t G [0,t']) = P(W > t'). 

Thus, it is enough to show that E(e 7M/ ) < oo. Furthermore, we may mod- 
ify the process (t so that all transitions to b are suppressed, because this 
can only increase W (since b G B). If we make this modification, it is then 
straightforward to verify, using (14), that the transition rule for the result- 
ing process can be described as follows. Suppose that the current state is 
(?7,Mi,U2) with u\ U2- Then, for every ordered pair of distinct vertices 
(v,w) at rate ( 7nfejjT}+^ ) x 7rh"> an attempt is made to move a particle 
from v to w, where an attempt proceeds as follows. If v is empty, or the 
transition would put the low ranking particles in the same location, then 
the attempt fails. Otherwise, a particle is chosen "uniformly" from v [i.e., 
a high ranking particle is chosen with probability ^j+ 2 ) ano - a ^ ow ran ki n g 
particle is chosen with the remaining probability] and moved to w if the 
reversed move is a transition of Q (i- e -, the high ranking particles are always 
allowed to move and the low ranking particles are only allowed to move if 
the destination vertex is empty). 

Note that Q is somewhat similar to Q , except for the motions of the low 
ranking particles. In Q, the low ranking particles occasionally jump from 
an empty vertex to a nonempty one, whereas in Q they occasionally jump 
from a nonempty vertex to an empty one. The high ranking particles move 
nearly the same way in both processes, except that in Q they have a slight 
preference for moving to vertices that contain a low ranking particle. 

The reader might wonder why we are studying the time reversal Q instead 
of Ct itself. The reason is that in Q, since the occupancies of the vertices con- 
taining j + 1 and j + 2, respectively, can make big positive jumps, it is harder 
to rule out the possibility that it takes a long time for these occupancies to 
be the same. 
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Up to time W, the rate at which any nonempty vertex v expels a particle 
is at least 1 — ^ ^+2) ' ano - ^he ra ^ e a ^ wri ich the process attempts to move 
a particle to v is at most 1 + ^- * +2 ^ , Let 

(17) M t = max(^(utO' + 1)), + 2))), 

that is, is the maximum number of particles in a vertex with a low 
ranking particle. Before time W, we have rj t (vt(j + 1)) / rjt(vt(j + 2)). Hence, 
up to time W, the rate at which Mt decreases when Mt = x (i.e., the rate at 
which the process moves to a new state that decreases Mt) is at least 1 — - , 
and the rate at which the process attempts to increase Mt [i.e., the rate at 
which the process attempts to move a particle to the vertex that achieves 
the maximum in (17)] is at most 1 + ^ Let B t be the number of attempts 
to increase M up to time t that fail because the potential source vertex is 
empty. Note that when a vertex is empty, it attempts to move a particle 
to the vertex that achieves the maximum in (17) at rate at least l/(n — 1). 
It follows that, given Zt, the conditional distribution of Bt stochastically 
dominates a Poisson(Zt) random variable. Furthermore, since Zt is in the 
cr-field generated by the amounts of time the process spends in each of its 
states up to time t, Lemma A.l in the Appendix implies that the conclusion 
of Lemma 7 holds with P replacing P. (Formally, Lemmas A.l and 7 are not 
enough, because we altered the process C by constructing the amalgamated 
state b. However, since this change only takes effect after time W, it does 
not affect our conclusion here.) 

Thus, as in the proof of Lemma 9 [see the equations leading up to equa- 
tion (10)], we can apply part (a) of Proposition 8 to show that for t suffi- 
ciently large, we have 

(18) P^B t <^^j<eM-ct/(p + l) 2 ), 

for a universal constant c. 

Let p = 64C~ 1 (p + 1) and let a = ^^pjy ■ For integers j > 2, let w(j) = 

-max(j^ T y,a). Define W(l) =0 and for j > 2, let W(j) =T,i=2 w (j)- 
When Mt jumps downward from j to i, where i < j, we define the size 
of the jump by J2k^i+i w {k)- Note that the size of a jump is zero unless 
i < j — 1, and when the size of a jump is nonzero it is a negative number. 
Strictly negative jumps can only occur when a particle ranked either j + 1 
or j + 2 moves to an empty vertex, or when the particle of rank j + 1 moves 
to the vertex containing the particle of rank j + 2. 

Let Jt be the sum of the sizes of all jumps up to time t and let Ut = 
W(Mi) + J t . For t > 0, let T t denote the cr-field generated by {(( s , B s ) : < 
s<t}. 
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Lemma 11. LetY t = Ut — aB t + ^-- Then (Y tA w, Ft) is a submartingale. 



Proof. Suppose that t < W and that the current configuration of Q 
makes M% = x. Let v be the vertex that achieves the maximum in equation 
(17) and let p be the the fraction of vertices in V — {v} that are empty. Note 
that 



B t 
Ut- 



Bt + l, 

U t + w(x + 1) 

U t - w(x), 



at rate at most p 



x + l 



at rate at most (1 — p) 
x — 1 

at rate at least . 



x + l 



Hence, lim e _ -E((U t+e 
'x + l 



aB t+e ) - (U t - aB t )\F t ,t < W) is at least 



V 



x 



+ 



X 



1 



a + (l-p) 
1 



x + l 



max 



1 



x(x + 1) ' 



X 



> 



■ max 



x + l 



x{x — 1) ' 



ft 



max 



x(x + 1) 



,ft + 



■ max 



x(x — 1) 



, a 



This is zero unless a > ? -m , m which case it is at least 

x(x+l) ' 



(19) 



x + l 



a + 



-2ft 



a 



-2a 
P 



Recall that a = ^qr[y- Thus, if ft > x ^+\) > then x > p, so > 
Define Mt = YtAW- The above calculation shows that 

lim e~ 1 E(M +£ - Mt\T u t < W) > 4a/ p - 2a/ p = 2a/ p. 

It is also clear that for some universal constant A > 0, we have 

e~ 1 E(M t+e -M t \Tt,t<W)>0, 

for every t, whenever e < A. Thus, for every t, the discrete-time process 
{Nt+kS '■ k = 0, 1, . . .) is a submartingale whenever d < A. It follows that for 
every s > t, we have 'E(J\f s \J r t) > A/j, since s can be written as t + k5 for some 
positive integer k, and (Nt+k8 - k>0) is a submartingale. It follows that Mt 
is a submartingale. □ 

We will now briefly sketch how Lemma 11 will be used to prove Lemma 
10. First, we will show that aBt grows much faster than ^j- as t — > oo. But 
Lemma 11 and Azuma's inequality imply that YtAW is n °t likely to be much 
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smaller than when t is large. Since Ut < 0, this means that W is not likely 
to be very large. 

More precisely, let T = and for j>l, let Tj = W A inf{i > T.,_i : \Y t - 
Ytj.! I > \}- For > 1, let M fc = Y" Tfe . Then {M k :k>0} is a submartingale 
and |Mfc+i — Mfe| < 1 because the magnitudes of the jumps of Y t are at most 
2- Thus, Azuma's inequality implies that for any constant A > 0, we have 

pYrmnM^ - M < -Aj < e~ bA2/k 

for a universal constant b > 0. Let TVt = inf{fc:Tfc > t A W}. The "drift" 
(viz., -£) and "rate of jumping" [i.e., lim^o e _1 P(^i+e / ^tl-^i)] of Yt are 
uniformly bounded. Thus, for universal constants h, 7 > 0, iVt is stochas- 
tically dominated by jt + X, for a Poisson(/it) random variable X . Thus, 
Proposition 8 implies that P(iVt > 7i + 2ht) < e~ * for sufficiently large t. 
So if we define /3 = 7 + 2/i, then P(iVj > < e~ Aht for sufficiently large t. 
Note that for any e > 0, we have for sufficiently large t, 

P(M Nt - M < -et) <P(N t > (3t) + P (minMj - M < -et 

(20) < e~ Aht + e -K<?t 2 )/W) 

(21) < e~ de2t , 

where d > is a universal constant. Note that Mo = Yq and |Mj\r t — 1*aw| — \ 
for all t. It follows that for every e > we have 

(22) P( W - Y < -et) < e~ deH 

for sufficiently large t, if we incorporate an extra factor of, say, \ into d. 
To complete the proof of Lemma 10, note that 

*(W>t)<P(B t <^) + P(w>t,B t >^ 

(23) 

< e - rf /(p+i) 2 + p( W >t,B t > 



8(p + l) 

where the second inequality uses equation (18). But if W > t, then Y t is 

YtAW, so 

- / Ct 
P[W>t,B t > 



<P(Y tAW <U t + 
= p(Y tAW <U t - 



8(p + l) 

4ta aCi 



p 8( P + i; 

aCt 
16(p+i; 
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<P(r tnw -r <- T ^-Y ), 

where the equality holds because p = 64C~ 1 (p + 1) and the second inequality 
holds because Ut <0- But for sufficiently large t, we have |lo| < 32 "^i) > so 

- / Ct \ - f aCt \ 

-da 2 C 2 t } 
(32)2(^ + 1)2/' 

where the second inequality uses (22). Combining this with (23) completes 
the proof. □ 

APPENDIX 

The following lemma was used in the proof of Lemma 10: 

Lemma A.l. Let Xt be an irreducible continuous-time Markov chain on 
a finite state space S. Fix s > 0, and for x G S, let W x = \ {0 < t < s : X t = x}\ 
be the amount of time the chain spends in state x up to time t. Then there 
is a constant c, independent of s, such that for any event A G o~(W x : x G S), 
and any y G S, we have 

P y {A) < cmaxP z (A), 

where ~P(A) denotes the probability of A when we run the time reversal of 
X t . 

PROOF. Suppose that X t has stationary distribution ir and let c = 
|5| x max Zj j / 7r(2;)/7r(y). Suppose that Xq is distributed according to n. Then 
{Xt : < t < s} has the same distribution as {X s ^t ■ < t < s}, where X is 
the time reversal of X, with Xq having distribution ir. 

Fix an event A G cr(W x :x G S). For states i, j, let Aij = Ad [Xq = i,X s = 
j\. Since {Xt : < t < s} has the same distribution as {X s _t '■ < t < s}, it is 
clear that P(Aij) = P(A,j), for all i and j. Hence, 

p^) = ^p(lK.) 

< -^-maxP (A y tZ ) 



< exp 
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151 

< -^-max7r(z)P(A) 
ir{y) z 

< c max P z (A) . □ 



The following proposition was used in the proof of Lemma 6: 

Proposition A. 2. Let Xt be the continuous-time simple symmetric 
random walk on the integers in which moves in each direction are made 
at rate 1 . Then there is a universal constant b > such that for every r > 1, 
we have P (X t ^0 for all t £ [l,r 2 ]) > br' 1 . 

Proof. Let R = \r\. Since the number of jumps by time 1 is Poisson(2), 
we have 

Po(X\ = 1) > |P(exactly one jump by time 1) = e~ 2 , 

and given that X\ = 1, the conditional probability of hitting R before re- 
turning to is since Xt is a martingale. Call this event B. 

It is well known that for any t > 0, the probability that simple random 
walk deviates to the left of its starting point by more that \ft up to time t can 
be bounded away from (uniformly in t). Hence, given B, the conditional 
probability that X t does not return to by time R 2 is bounded away from 
0. It follows that 

P {X t ^ for all t G [1, R 2 }) > bR" 1 

for a universal constant b. Since R G [r, 2r], the result follows if we incorpo- 
rate an extra factor of | into b. □ 
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